Relational Data Mining and ILP for Document Image Understanding

نویسندگان

  • Michelangelo Ceci
  • Margherita Berardi
  • Donato Malerba
چکیده

42.86%19.05%0.15%0.91%Affiliation59.09%13.64%0.40%0.37%Author44.00%40.00%0.09%0.40%Biography33.33%28.57%0.03%0.24%Caption74.86%30.60%1.83%14.95%Figure14.33%4.78%1.31%3.47%Formulae40.67%12.23%5.45%12.81%Index_term90.91%45.45%0.03%0.46%Page_number7.22%0.56%0.42%0.35% Paragraph–3.62%–22.39%References72.50%30.00%0.52%3.19%Running_head12.32%5.42%0.16%4.58%Section_title66.13%33.87%0.74%4.72%Subsection_title100.00%60.00%0.06%3.14%Table76.09%34.78%0.68%4.06%Title43.48%26.09%0.27%0.95% Average number of omission errors over positive examples, commission errors over negativeexamples. Mr-SBC results are obtained with n 1⁄4 2 and CostRatio 1⁄4 10. The best results are in bold.336M. Ceci et al.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Relational Data Mining Techniques for Historical Document Processing

Document image understanding denotes the recognition of semantically relevant components in the layout extracted from a document image. Automatic approaches for document image understanding are highly demanded today by organizations involved in the preservation and valorisation of historical documents that collect more and more document images, whose effective usage critically depends on their ...

متن کامل

Rule Learning from Semi-structured Documents by Inductive Logic Programming

One of the hot research areas is knowledge discovery on structured documents like HTML and XML documents. In the case of XML documents, most popular approach to mining a knowledge is structural approach which find some kind of similar pattern(often tree structure or XPath) in interested XML documents. On the other hand, there is relational data mining approach such as ILP(Inductive Logic Progra...

متن کامل

Discovering Knowledge through Multi-modal Association Rule Mining for Document Image Analysis

The paper introduces a descriptive data mining method to discover knowledge for the task of automatic categorization in document image analysis. We argue that a document image is a multi-modal unit of analysis whose semantics is deduced from a combination of textual content, layout structure and logical structure. So, the method considers simultaneously different modalities of document represen...

متن کامل

FOIL-D: Efficiently Scaling FOIL for Multi-relational Data Mining of Large Datasets

Multi-relational rule mining is important for knowledge discovery in relational databases as it allows for discovery of patterns involving multiple relational tables. Inductive logic programming (ILP) techniques have had considerable success on a variety of multi-relational rule mining tasks, however, most ILP systems do not scale to very large datasets. In this paper we present two extensions ...

متن کامل

Relational Learning

Most of the content-based approaches to text and web document classification explored in other related projects are based on the bag of words model, well known from the area of Information Retrieval. This model is simple and efficient, but fails to capture many additional document features such as the internal HTML structure, language structure and inter-document link structure. All this howeve...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • Applied Artificial Intelligence

دوره 21  شماره 

صفحات  -

تاریخ انتشار 2007